CDS

Accession Number TCMCG078C11942
gbkey CDS
Protein Id KAG0469813.1
Location join(10035086..10035207,10035949..10036042,10036200..10036264,10036362..10036573,10036663..10036726,10037166..10037238,10037366..10037486,10042694..10042781,10042861..10042986,10043059..10043168,10043256..10043404)
Organism Vanilla planifolia
locus_tag HPP92_016513

Protein

Length 407aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000008.1
Definition hypothetical protein HPP92_016513 [Vanilla planifolia]
Locus_tag HPP92_016513

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20855        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAATTGCAAGGGGAAAGGCGGAGGATTTGGGCAGCTGCAGGGAAGAAGTGTGGTACCCTCGAAATGGACACTAATGCTTTGCATTTTAAGCTTCTGCACTGGTCTTCTCTTCACAAATAGAATGTGGACATTGCCCAACACACATAATACAATTATACCGATTAGGAATCTTGGTGATAGGTCAGATTTACCTGGAGGTTGTCATTCGAAAATGATCAATGAAAATAGAGAACCTAAGGAGATTTCTGGAGACGCATCCAAAGCCACTTATGATATACACACACTAGATAAAACTATAGCAAATTTAGAAATGGAATTAGGAGCAGCGAGGGCCACGCAAGAGTCTATAATCAGTGGTTCCCCAGTATCAGATACCCTAAAATCCATGATATCTGGTGTAAGGCGGAAATATTTAATGGTTGTTGGTATCAACACTGCTTTTAGTAGCCGTAAGCGAAGAGATTCAGTTCGTGCTACTTGGATGCCTTCAGGTGAAAAAAGAAAGAAACTTGAAGAAGAAAAGGGGATCATCATCCGCTTTGTCATAGGCCATGGTGCAACATCCGGTGGTATTCTGGACAAAGCAATTGAAGCTGAGGATAGTAAACATGGGGATTTCATTAGGCTGGATCATGTTGAAGGTTACCTCGAGCTCTCAGCAAAGACCAAGGCATATTTTGCAACAGCTGTCAATGCATGGGATGCTGAATTCTATGTGAAGGTTGATGACGATGTACATGTAAATATAGGAACCCTTGCCGCTACACTTTCCAGGCACAGGTCAAAGCTTGGGGTGTACGTGGGGTGCATGAAGTCCGGCCCTGTCCTAGCTCAGAAGGGGGTGAGGTATCATGAACCCGAGTATTGGAAATTTGGTGAATATGGAAACAAATATTTCCGACATGCCACTGGCCAACTGTATGCAATTTCAAGGGACTTGGCCATTTACATATCCATAAACCAGCACGTACTACACAAGTATGCAAATGAGGATGTCTCTTTGGGAGCTTGGTTTATTGGATTGGATGTCGAACACATTGATGACCGTAGACTATGTTGTGGTACCCCACCTGACTGCGAGTGGAAGGCCCAAGCGGGCAACATCTGCGTTGCCTCGTTTGATTGGAGCTGCAGTGGGATTTGCAGGTCAGCCGAGAGGATGAAAGAGGTCCATCATCGCTGCGGTGAAGGCGAAAACCTTCTGTGGAATGCTGCATTTTAG
Protein:  
MNCKGKGGGFGQLQGRSVVPSKWTLMLCILSFCTGLLFTNRMWTLPNTHNTIIPIRNLGDRSDLPGGCHSKMINENREPKEISGDASKATYDIHTLDKTIANLEMELGAARATQESIISGSPVSDTLKSMISGVRRKYLMVVGINTAFSSRKRRDSVRATWMPSGEKRKKLEEEKGIIIRFVIGHGATSGGILDKAIEAEDSKHGDFIRLDHVEGYLELSAKTKAYFATAVNAWDAEFYVKVDDDVHVNIGTLAATLSRHRSKLGVYVGCMKSGPVLAQKGVRYHEPEYWKFGEYGNKYFRHATGQLYAISRDLAIYISINQHVLHKYANEDVSLGAWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSAERMKEVHHRCGEGENLLWNAAF